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Abstract 



We present a new technique for proving empirical process invariance prin- 
ciple for stationary processes {Xn)n>o- The main novelty of our approach 
lies in the fact that we only require the central limit theorem and a moment 
bound for a restricted class of functions (/(^n))n>0) not containing the indica- 
tor functions. Our approach can be applied to Markov chains and dynamical 
systems, using spectral properties of the transfer operator. Our proof consists 
' of a novel application of chaining techniques. 

^ ■ 1 Introduction 

00 

^ I Let (X„)„>o be a stationary ergodic process of M-valued random variables with 

marginal distribution function F{t) = P{Xq < t). Define the empirical distribution 
^ ■ function (F„(t))teK and the empirical process (t/n(^))teM by 



1 

i=l 

Unit) ■= V^{Fn{t)-F{t)), te 



The empirical process plays a prominent role in non-parametric statistical inference 
about the distribution function F. In all statistical applications, information about 
the distribution of the empirical process is needed. 

In the case of i.i.d. observations, Donsker [7j proved in 1952 that the empirical 
process converges in distribution to a Brownian bridge process, thus confirming an 
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earlier conjecture of Doob [S]. In 1968, Billingsley [2] extended Donsker's theorem to 
some weakly dependent processes, specifically to functionals of 0-mixing processes. 
One of the applications of Billingsley's theorem is to the empirical process of data 
generated by the continued fraction dynamical system T : [0, 1] — >■ [0, 1], T(x) := ^. 
Since 1968, many authors have studied the empirical process of weakly dependent 
data. Invariance principles for empirical distribution of strong mixing random vari- 
ables were proved in 1977 by Berkes and Philipp [I] and in 1980 for the multivariate 
case by Philipp and Pinzur [19]. Later, absolutely regular processes were studied 
by Doukhan et al. [9] and Borovkova et al. [3]. Many other weak dependence 
conditions have been studied, see Doukhan and Louichi [10], Prieur [20], Dedecker 
and Prieur [B], Wu and Shao [22], Wu [2T]. From the point of view of dynamical 
systems, an empirical process invariance principle for expanding maps of the interval 
was proved by Collet et al [5]. Another one for ergodic torus automorphisms was 
proved by Durieu and Jouan [H]. 

Proofs of empirical process invariance principles usually consist of two parts, 
establishing finite-dimensional convergence and tightness of the empirical process. 
Finite-dimensional convergence, i.e. convergence in distribution of the sequence 
of vectors (Uniti), . . . , Unitk))n>i, is an immediate consequence of the multivariate 
CLT for partial sums of the process 

. . . , l(-oo,tfc] iXn))n>l- 

Tightness is far more difficult to establish. One ingredient is usually a probability 
bound on the increments of the empirical process 

Unit) - f/„(s) = ^ X^{1m(X,) - (Fit) - F{s))}, 

for a fixed pair s < t. Such bounds can in the simplest approach be obtained from 
bounds on the 4-th moments of Un{t) — Un{s). Other results require higher order 
moment bounds or even exponential bounds. 

The traditional approach to empirical process invariance principles, as outlined 
above, works well in situations when the sequence of indicator variables (l{s,t](X„))„>o 
inherits good properties from the original process (X„)„>o. This holds, for exam- 
ple, when {Xn)n>o is strong (uniform, beta) mixing, because then (l(s,t](X„))„>o 
has the same property. There are, however, situations where this is not the case or 
at least not easy to establish. For some types of Markov processes and dynamical 
systems, see e.g. Hennion and Herve [T7], one has good control over the properties 
of {f{Xn))n>Q when / is a Lipschitz function, but not for indicator functions. For 
example, Gouezel [TB] gave an uniformly expanding map of the interval which has 
a spectral gap on the space of Lipschitz functions but not on the space of bounded 
variation functions. In this paper, we develop an approach that is strictly based 
on properties of Lipschitz functions f{Xi) of the original data. We make two basic 
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assumptions, namely that the partial sums of Lipschitz functions satisfy the CLT 
and that a suitable 4-th moment bound is satisfied. 



For our proof we develop a variant of the classical chaining technique that uses 
only Lipschitz functions at all stages of the chaining argument. We replace the 
usual finite-dimensional convergence plus tightness approach by a method of ap- 
proximation by a sequence of finite-dimensional processes, which are different from 
the coordinate projections (t/„(ti), . . . , [/„(tfc)). We show convergence in distribu- 
tion of the finite-dimensional processes and prove that the finite-dimensional process 
approximates the empirical process. In the final step, we use an improved version 
of a Theorem of Billingsley [2], see our Theorem [2] below, to establish convergence 
in distribution of the empirical process. 

In the present paper, we make two assumptions concerning the process (Xj)j>o, 

1. For any Lipschitz function /, the CLT holds, i.e. 

1 " 

where A^(0, o"^) denotes a normal law with mean zero and variance 

oo 

= EifiXo) - EfiXo))' + 2 ^ Cov(/(Xo), /(X,)). 

i=l 

2. A bound on the 4-th central moments of partial sums of (/(Xj))j>o, / bounded 
Lipschitz with E{f{Xo)) = 0, of the type 

I i=i 

< Cm} (n||/(Xo)l|i log'^ (1 + 11/11) + nl/(Xo)||?log^ (1 + 



(1.2) 

where C is some universal constant, a and f3 are some nonnegative integers, 
II / 11= sup I f{x) I +sup ■ — - — ■ 

X Xf^y \ X y \ 

and 

rrif = max{l, sup | /(x) |}. 

X 

Remark 1.1 These assumptions can be verified for a large class of Markov chains 
and dynamical systems. Concerning the CLT for Lipschitz functions, many results 
can be found in the literature; see e.g. Hennion and Herve JT^ for the spectral 
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gap method and Bradley ^ for mixing approach. Durieu fT3f has established 4-ih 
moment bounds of the type U.^) for Markov chains and dynamical systems under 
spectral properties. For more details and concrete examples see Section [7] of this 
paper. 

We shall assume some regularity for the distribution function of Xq. We define 
the modulus of continuity of a function / : M — > M by 



LjfiS) = sup {\f is) -f{t)\ : s,teR,\s-t\<S}. 
We can now state our main result. 



Theorem 1 Let (Xj)j>o be an ^.-valued stationary ergodic random process such that 
the conditions U.l\) and U.^) hold. Assume that Xq has a distribution function F 
satisfying the following condition, 

'-^f{S) < D\ \o^{S)y^ for some D > and 7 > max{ — , /?}, (1.3) 

then 

where W{t) is a mean-zero Gaussian process with covariances 

EW{s)-W{t) = Cov(l(_oo,s](Xo),l(_oo,t](Xo)) 

00 

+ ^Cov(l(_oo,s](^o), l(-oo,t](^fe)) 

k=l 
00 

+ XI Cov(l(_oo,s](^fc), l(-oo,t](^o))- 
fc=l 

Further, almost surely, {W{t))t(zR has continuous sample paths. 

Remark 1.2 In particular, if Xq has a Holder- continuous distribution function then 
/ fO) holds. 

Remark 1.3 If the Xi's are i.i.d., (W^(t))feiR is a Brownian bridge, but this is not 
the case for dependent variables, as in Billingsley or Collet et al. f^. 



In order to prove Theorem [T], we apply the following theorem, which is a stronger 
version of Theorem 4.2 of Billingsley [2J in the complete case. We do not need to 
assume a priori that X^"^^ has a limit in distribution. 
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Theorem 2 Let {S,p) be a complete separable metric space and let and 
X^"^\ n,m> 1 be S-valued random variables satisfying 

lim limsupP(p(X„,X^'")) > = 0,Ve > 0. (1.5) 

Then there exists an S-valued random variable X such that 

Xn X as n ^ 00. 
Moreover ^ X as m ^ 00. 

Both theorems are proved in Section 2 and Section 3. 

2 Proof of Theorem [1] 
2.1 The bounded case 

We first prove the resuh for bounded variables. Let (Xj)j>o be a [0, l]-valued sta- 
tionary ergodic random process such that fll.ip . fll.2p and (11.31) hold. 

In our approach we work with Lipschitz continuous approximations to the indi- 
cator functions l(_oo,t](a;). Given a partition 

= t[, < . . . < C = 1 

we define 

t, = F-%) 

where is given by 

F-^{t) = sup{s e [0, 1] : F{s) < t}. 
Thus, by continuity of F, we have a partition 

0<to<--- <tm = l. 
We introduce the functions (pj : [0, 1] — R by 

X — tj^i 
tj_i - tj^2 



ipj{x) = ip [ — ) , for j = 2, . . . , m 

where 



(p{x) = l(_oo-i](a;) - xl(_i,o](a;) (2.1) 

and (pi = 0. 

The function ipj will serve as a Lipschitz-continuous approximation to the indicator 
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function l(_oo,tj_i] (x). Note that fj{x) depends on the partition, not only on the 
point tj_i. We now define the process 

i=l j=l 

m / ^ n \ 

j=i \ 1=1 J 

Note thatFi"^(t) is a piecewise constant approximation to the empirical distribution 
function For t G [tj^i,tj], we have the inequality 

We define further 

m 

and finally the centered and normalized process 

Ut\t) = {K!r\t) - F^"^\t)) . (2.2) 

Our proof of Theorem [T] now consists of two parts, each of which will be formu- 
lated separately as a proposition below. The theorem will follow by application 
of Theorem [21 where {S,p) is the space of cadlag functions -D[0, 1] provided with 
the Skorohod topology and the metric do; see Billingsley [^, p. 113. Note that 
{D[0, 1], do) is a complete separable metric space. 

Proposition 2.1 For any partition = < . . . < t'^ = 1, there exists a piecewise 
constant Gaussian process 

(l^(™)(t))^^^^^ such that 

The sample paths of the processes are constant on each of the inter- 

vals [tj-i,tj), 1 < j <m, and Vr('")(0) = o7 The vector (W^'^^ti), . . . ,W^'^\t„,)) 
has a multivariate normal distribution with mean zero and covariances 

Cov(H^('")(t,.i),H^('")(t,_i)) = Cov(y.,(Xo),¥^,(Xo)) 

oo 

+ Y,CoY{ifi{Xo),ipj{Xk)) 

k=l 

oo 

+ J]Cov(¥;i(Xfc),¥;,-(Xo)) 
fe=i 



6 



Proof. 

Using f 1 1.11) and the Cramer- Wold device, we can show that for any Lipschitz 
functions fi, . . . , fk, the multivariate CLT holds, i.e. 

1 " 

^ . . . , MX,)) - EiMXo), MXo))} ^ iV(0, S;„...,;J, 

V ^ • 1 
* 1=1 

where N{0, Sj^ ...j^.) denotes a multivariate normal law with mean zero and covari- 
ance matrix 

^/1,-,/fc = (^/^'/j )l<i,j<k 

where for any Lipschitz functions /, g we define 

oo 

af,, = CoY{f{Xo),g{Xo)) + Y.CoY{f{Xo),g{Xk)) 

k=l 

oo 

+ 5^Cov(/(X,),^?(Xo)). 

k=l 

This result proves the proposition. □ 

Proposition 2.2 For any e,r] > there exists a partition = t'^ < . . . < t'^ = 1 
such that 

limSUpP ( sup \Unit) - Uj:^'\t)\ > £ ) < 7]. 
n^oo \0<t<l / 



Proof. 

By a variant of the well known chaining technique we will control 

pf sup \Un{t) - Ut\t)\ > e) , 

\0<t<l / 

and then show that this probability can be made arbitrarily small by choosing a 
partition = < . . . < = 1 that is fine enough. From here on we assume that 
the partition = < . . . < = 1 is regularly distributed. Let h = ^ = t'j — t'j_i, 
for j = 1, . . . , m. 

On the interval [t^_i,t^] we introduce a sequence of refining partitions 
,/ _ /(fc) f{k) f(k) _ ,/ 

by 

= t;_, + /.A , o</<2^ 

Let us define 
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We now have partitions of [tj_i,tj^, 



Ij—l — Sq ^ -5]^ < . . . < — Ij- 



For convenience, we also consider the points 



M _ 17-1 / ^ 



and the points 



1 - ^ I S-i 



For any t G [tj-i, tj) and /c > we define the index 

l{k,t) = maxj/ : sf^ < tj . 
In this way we obtain a chain 

hnking the left endpoint tj^i to t. Note that for t G [tj_i,tj) we have by definition 
Ut'\t) = U't\tj^i). We define the functions V^f \ > 0, < / < 2^ by 

X 



where if is defined as in (12. ip . Note that '4^^l^f){x — s^^^l^^-^) = <fj{x). To be consistent, 

in the case j = 1, we have to fix ipll'^ = 0, for all A; > 0. We build a chain bridging 
the gap between 



1 



1=1 



and 



by the functions 



1 " 



n 

i=l 



^A^) = W(o,t)(^-4°o,t)) 

— W(l,t)(^ ~ -^/{l.i)) 

< • • • 

< ix-s^"^^ 1 

— ^l{K,t)\-^ l{K,t)> 



< l(_oo,t](x) 
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— Vl(K.t)+2y-^ ■^l(K.t)+2)^ 



where K is some integer to be chosen later. In this way we get 

A' _ n 

k=l i=l 
1 " 



(K) - 
liK,t)- 



(2.3) 



1=1 



Observe that by definition of s^'^^^^ and of 



< l(-oo,t](X,)-4^),)(X,-.|-;,)) 



(^) 



l(K,ty 



SK) ^ 
^l{K,t)) 



From (12. 3p we get by centering and normahzation 



K 



Un{t)-Ut\t) 



k=l ^ i=l 



(fc) ^ 



(k-l) X 



^l(k~l,t)' 



1 " 
^ 1=1 



For the last term on the r.h.s. we have the following upper and lower bounds, 
1 



^ i=l 



^ i=l 



SK) 



(K) ^ 



n ( Elf) 



(K) 

l{K,t)+2 



l{K,t)+2> 



and 



1 " 

-7^ E { (l(-oo,.](X.) - F{t)) - - 4§^,^) - E^fl^iX, - sfl^)) } 

* i=l 

>-^[F{t)-Ei^\^l^^{Xi-s\ 



SK) ^ 
^l{K,t)' 



Now choose K = A + 



log 



log-^(2) 



and note that 



e ^ h e 



and thus 



< Vn 

e 

< -. 



Thus we get for all t e [tj-i, tj], 
\U4t)-Ut\t)\ 



k=l 



(k) ■ 
l{k,t). 



i=l 



n 

E { (W(K,t)+2(^» ~ ^l{K,t)+2) ~ ^'^l{K,t)+2i-^i ~ ^'i{K,t)+2)) 



+ 2- 

Note that by definition of l{k,t) and of we have Si'^j^^l^^ e {^I'^kt)^ ^l{kt)-i} 
thus 

i(k-i,t)- 

Therefore 

sup \Un{t) - Ut\t)\ 
tj—i^t^tj 

K 



1 

< > —p= max 

V?T' 0<K2'=-l 

fe=i ^ — 



5:((W->(x,-.;->)-£;#i(Xi-.j")) 

n 

E((W5(^<-45)-£W5(A-.-.S)) 



max 



i=l 



V^l 0<i<2^-l 



e 

+2- 



Now 



take £fe := ^^{^+1) ^^^^ Sf=i ^fc ^ |- 
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Then we obtain 



sup 



\Un{t)-u^r\t)\> 



1=0 ^ V 

At this point we use Markov's inequahty together with the 4-th moment bound 



E{(*ffi(-'f.-»l5)-£^W5(A-.-»!5: 

j=i 



P\ sup \Ur,{t)-Ui^\t)\>e 



k=l 1=0 ^ k 

■log" (1 + 



< 



+ - 

4 



Ipl [Xo - Si ) - 1p^^^ [Xq - S^^j ) 



. log^ 1 



,{k) ,(k~l) 



(A')^ 



log" 1 



+ - 



.log''(l + ||C-W"'||)}. 



Note that 



V^i (^Ao - Si ) - [Ao - S^^j ) 



< 



< 



3h 
2^ 
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and 



< 



3h 



If fll.3D is satisfied, 



(k) 



< 1 + 

< 1 + 



inf <^ s > : Vt, F{t + s) - F{t) > 



2k 



1 -1 



inf <^ s > : D\\og{s)\-^ > 



h 
2^ 



1 + exp 



D2'' \ ^ 



n -1 



Thus we have 



P{ sup \Un{t) ~ Unitj)\ > e 

. tj-i<t<tj 



< 



< 



K 



k=l 



2k 



h 



k=l ^ 



h 



1 1 3/i 



+4^^2----log"|2 + exp 
+4'^2^^|i^log'|2 + exp 



< 7?k^h{^—] +—ry -^h 

- ne^ ^ \ h 

k=l ^ ' k=l 



D2^ 



n e"* \ h 
h C" I x/n\'< 



k=l 



k=l 



where C and C" are some constants and we have used convergence of the series 
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Finally, using mh = 1, 



pf sup \U^{t)-Ut\t)\>e] 

\0<i<l / 



< 5^p sup \u4t) - u!r\t)\ > E 

\tj-l<t<tj 

_B I C q C 2— — 

< mhn^i ——srK + m — rh ^ 

Now, the first of the two final summands converges to zero as n — >■ cxd. The second 
can be made arbitrarily small by choosing a partition that is fine enough (i.e. h 
small) . □ 

We used a different technique than the usual finite dimensional convergence plus 
tightness. Of course, since the weak convergence implies the finite dimensional 
convergence and the tightness, these two properties are satified. Nevertheless, we 
can also deduce a tightness criterion implying that, almost surely, the hmit process 
has continuous sample paths (see Billingsley [2], Theorem 15.5). 

Proposition 2.3 For all e, t] > 0, there exist S > and N > such that for all 
n>N, 

P( sup \Un{t) - U„.{s)\ > e] <7]. 

\\t~s\<S J 

In particular, P{W eC{^)) = l. 
Proof. 

Let £ > and > 0. Let m be an integer such that 

C 7] , 

< I (2.4) 



and consider the regular partition of [0, 1] with mesh ^. 

By Proposition 12.21 there exists > such that for all n > N, 

pfsup \U^{t)-Ut\t)\>'-)<l. 

\0<t<l o/ 4 
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Let (5 > such that S < —. Then, for all n> N, 



P\ sup \Un{t)-Un{s)\>e 

At-s\<5 



<2P{ sup \Un{t) - [/r(t)| > ^"j + P f sup |f/r(t) - > 

Vo<t<l oj \\t-s\<5 



<l+p\ sup i/7r(t)-t/r(^)i>| 

We recall, as tj = F-\t'j) = F-\J^), that 



|(^,-(Xo)-<^,-+i(Xo)||i < P(V2 <Xo <t,) < -, 
11^,11 < l + expM^ 



Thus, by the 4-th moment bound ( 11.20 . 
Now there exists N' > N such that 



rae^ \m J 4 

Finally, by flZij) . 

PI sup \Unit) - Unis)\ >e] <ri. 

\\t-s\<5 



□ 



2.2 The unbounded case 

Let (Xj)j>o be an M- valued stationary ergodic random process such that (11.11) . fll.2p 
and fll.3p hold. We will show that it can be reduced to the case of bounded variables. 

For all X < y E M., we say that the closed interval [x, y] is a 'bad' interval (for 
F) if 

F{y) — F{x) > y — X. 

We say that [x,y] is a maximal 'bad' interval (for F) if for all 'bad' intervals [a,b], 
we have [a, b] C [x, y] or [a, b] fl [x, y] = 0. 

We denote by I"^""^ the set of all maximal 'bad' intervals. 
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Lemma 2.4 

(i) The Lebesgue measure of 

i-= U [^'^] 

is smaller than 1. 
(a) For all [x,y] G 1"^°-^ ^ we have 

F{y) - F{x) = y-x. 

Proof. 

Because F is non-decreasing and takes values in [0, 1], the first assertion is clear. 
If for X < y, F{y) — F{x) > y — x, then there exists e > such that 

F{y) — F{x) > y — X + e. 

Thus, for all z > y such that z — y < e, hj monotonicity of F, we have 

F{z) - F{x) > F{y) - F{x) 

> y — X + e 

> z — X 

and then [x, y] is not maximal. □ 
We define the function g from M to ]0, 1[ by 

for all [x, y] G /'™"^ for all t G [x, y], g{t) := F{x) +t-x 

and 

for all t ^ I, g{t) := F{t). 
Then is a 1-Lipschitz function. 

We define the [0, l]-valued stationary ergodic random process {Yi)i>Q by 

Yi=g{Xi), i>Q. 

Since g is Lipschitz, {Yi)i>o satisfies (11. ip and (11. 2p . 
We also have 

G{t):=P{Yo<t)=Fog-\t) 

where 

g-^{t) = sup{s G M : F{s) < t}. 
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Clearly, G is the identity on (^(M \ J). Further, for all [x,y\ G J'""^, the graph of G 
on g{\x, y\) is the graph of F on [x, y] and the Lebesgue measure of g{\x, y\) is equal 
to the Lebesgue measure of [x,y\. Then 

^g{.^) ^ max{a;i?(5), 5} 

and ([L3D holds. 

We define the associated distribution functions and empirical processes 

1 " 

Unit) := v^(F„(t)-F(t)), tGM, 
1 " 

Gn{t) := -^l[o,t](y.), 0<t< 1, 

1=1 

Vnit) := v^(G„(t)-G'(t)), 0<t<l. 



We have 

t/n(t) = VMt)), t e M. 

By the theorem for bounded variables (section l2.ll) . 



(K(t))o<t<i ^ iVit))o<t<i, 

where V{t) is a mean-zero Gaussian process such that P{V G C[0, 1]) = 1. 
Applying Theorem 5.1 of Billingsley [2] with 

h:D[0,l] — > D{R) 
X I— i> X o 

we get the weak convergence of (f/„(t))teR to a Gaussian process 

{W{t))teR={Vog{t))teR 

such that P(Vr G C(M)) = 1. 



3 Proof of Theorem [2 



Lemma 3.1 Let {X,d) be a complete metric space and let Xn,x^\x^ & X, n > 
l,m > 1 be given with the properties 

lim = Vm (3.1) 

lim limsupd(a;„,x(7')) = 0. (3.2) 

Then x := limm^oo x^™''' exists and 

lim d{xn, x) = 0. 



16 



Proof. We will first show that x^"^^ is a Cauchy sequence. Given e > 0, choose M 
so big that Vm > M 

limsup x^™"-*) < -. 

?i— »oo 4 

Now take mi, m2 > M. For all n sufficiently large, we have then 

d{Xn,xl^ ^ ^ 

d{Xn, x[^ ^■') < — , 

and hence, by the triangle inequality d{x^'^^\ x^"^^^) < e. Thus {x^'^'>)m>i is a Cauchy 
sequence and hence x := linim^oo a;^'"-* exists. 

It remains to show that lim„^oo Xn = x. Given e > 0, choose mo so that 

limsup(i(a;„,x^""'^) < ^ 

n— »oo 4 

and < |. Then choose N such that for all n > 

Using the triangle inequality, we get 

d{xn, x) < e 

for all n>N. □ 

Proof of Theorem\^ Let /ii"^^ and /i*^"*) denote the distributions of the random 
variables and X^'^'> respectively. These are elements of Mi (5), the space 

of probability measures on 5*. We consider the Prohorov metric d on Mi(S'), defined 
by 

d(yu, v) = inf {e > : ^{A) < v{A^) + e VA C 5 measurable} . 

Note that (Mi (5*), d) is a complete metric space. If F, Z are two S- valued random 
variables with distributions Py,Pz, satisfying 

P{p{Y,Z)>e)<e, 

then d{PY, Pz) < Moreover d metrizes the topology of weak convergence, i. e. 
/i„ — /i if and only if /i) 0. We now apply Lemma 3.1 to /i^, iJiiT'\ Note 
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that fl3.ip is a direct consequence of fll.4l) . Given e > 0, by fll.Sp we can find rriQ 
such that for all m > mo, 

lim sup P(p(X„, X("^)) > 5) <£. 

n^oo 

Fix such an m; then we can find uq such that Vn > no 

P(p(X„,X(™))>e)<e 

and thus d{nn, IJ'n^^) < Hence 

limsup(i(/i„,/i^'"^) < £ 

n— >oo 

for all m > mo, showing that (13.21) holds. Thus by Lemma 3.1, there exists a 
probability distribution fi on S such that 

lim = 

n— >oo 

lim d{fj.n, p) = 0. 

m— >oo 

Finally, let X be an S-valued random variable with distribution fi. Then X^^^ —>■ X 
as m 00 and X„ — >■ X as n ^ cxd. □ 



4 Examples 

According to Durieu [13] the 4-th moment bound (11.21) holds for Markov chains and 
dynamical systems under some assumptions on the Markov transition operator or 
the Perron- Frobenius operator. 

Let {E, d) be a separable metric space and (X,fc)fc>o be an valued Markov chain 
with transition operator Q and invariant measure v. Denote by C the space of all 
bounded Lipschitz continuous functions from ii^ to M equipped with the norm defined 
in (11.21) . We say that the Markov chain (Xfc)fe>o is /^-geometrically ergodic if there 
exist C > and < < 1 such that for all f E C, 

\\Q'f-Uf\\<Ce'\\fl (4.1) 

where 11/ = Ej,f{Xo). This condition corresponds to the fact that the Markov 
operator is quasi-compact on the space C with 1 as only eigenvalue of modulus one 
and simple (see Hennion and Herve [Hj). Since £ ^ L°°, the following result is a 
special case of Corollary 2 of Durieu [13] . 

Proposition 4.1 //(X„)^>q is an C- geometrically ergodic Markov chain then 
holds for all f & C such that Ef{XQ) = 0, with a = 3 and (3 = 2. 

The same is true for dynamical systems whose Perron-Frobenius operators satisfy 
This gives a large class of examples where our result applies. 
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Linear processes 

Let {A, \\.\\a) be a separable Banach space and A its Borel sigma algebra. Let (aj)j>o 
be a sequence of linear forms on A such that there exist C > and < < 1 such 
that 

Wii < ce\ (4.2) 

where |aj| = sup||^m<;^ |ai(a;)|. Let (ej)jgz be an i.i.d. bounded random sequence 
with values in a compact subset B G A and marginal distribution fi. We define the 
real- valued linear process {Xk)k>o by 

Xk = ^ai(efc_i), A; > 0. 

Several results have already been established for empirical processes of linear pro- 
cesses (see Doukhan and Surgailis [TT], Wu [2T], Dedecker and Prieur [B]). Here, 
assumption on the (ai)j>o is stronger than in the mentioned papers, but there will 
be no assumption on the distribution of the Cj's and assumption on the distribution 
function of Xq will be weaker. Note that {Xk)k>o can be viewed as a functional of 
a Markov chain. 

Let Yk = (cfc, Ck-i, ■ ■ ■), then {Yk)k>o is a stationary Markov chain on (with 
stationary measure yU*^^) and Xk = ^{Yk) where 

$ : 5^ — . M, <l>(xo, xi, . . . ) = ^ a,(x,). 

Let Q be the Markov transition operator of the chain. On B^, we define a metric d 
by 

d{x,y) = '^9'\\xi - yi\\A 

where x = (xj)j>o and y = {yi)i>o- As 5 is compact, then {B^, d) is also compact. 
Let us denote by C the space of all Lipschitz functions from i?^ to M provided with 
the norm {|.{| defined by 

11/11 

For all / G £ and for all x 

\Q'f{x)-Q'f{y)\ = 

< 



I,. ^M \f{x)-f{y)\ 

= sup |/(a;)| + sup ^ . 

x€B^'> x^y d[x, y) 

= {xi)i>o and y = {yi)i>o G B^, we have 

\EifiYk)\Yo = x)-EifiYk)\Yo = y)\ 
\E{f{ek, . . . , ei, xo, . . . )) - E{f{ek, . . . , Ci, |/o, • • • ) 
\\f\\E{d{{ek, . . . , ei, xo, . . . ), (e^, . . . , ei, yo, • • • ))} 

C9'\\f\\d{x,y), 
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and 



\Q'fix)-EfiYo)\ = \EifiY,)\Yo = x)-EfiY,)\ 

< E\f{ek,ek^i, . . . ,ei,Xo, . . .) - f{ek,ek-i, • • • )l 

< Ce'\\f\\E{dix,Yo)}. 

Then, we have for all / G £, 

\\Q'f-E{fiYom < ce'Wfi 

Since iC,\\.\\) C (L°^(;U®^), ||.||oo), by Proposition SH {Yk)k>o satisfies the 4-th 
moment bound f ll.2p with a = 3 and [3 = 2 for all Lipschitz functions. Further, 
for all / G £ the sequence ^"=oQ*/(^o) converges in L^(/x®^) and so by Gordin's 
theorem (see Gordin [15]), the CLT (11. ip is satisfied. Clearly, the function $ is a 
Lipschitz continuous function on B^, and for all Lipschitz function g : M — > M, go^ 
is also a Lipschitz continuous function on B^. Thus conditions (11.11) and (II. 2p hold 
for the process {Xk)k>o, for all Lipschitz function on M. Then Theorem [T] applies 
and we have 

Corollary 4.2 Let {Xk)k>o be a real linear process defined by a sequence of linear 
forms (aj)j>o and a sequence of i.i.d. bounded random variables {ei)iQZ, both on a 
measurable Banach space A. Assume (a^) satisfies ( [^.^[ j and the distribution function 
F of Xq satisfies 

'^f(^) < D\ log((5)|^'^ for some D > and 7 > 2. 
Then (f/n(^))teM converges in distribution to a mean-zero Gaussian process. 

In the paper by Dedecker and Prieur [6], Corollary 1, Xq has a bounded density. 
Here, the existence of a density is not needed. Our result is comparable to a result 
of Wu and Shao [22]. 

For a concrete example, consider A = {0, 1} , ai = , i > and = or 1 with 
probability |, G Z. Then 

Xk = 2j2'-^,k>0 

i>0 

is a stationary process with values in [0, 1] and the common distribution function of 
all the Xk is the Cantor function, which is not absolutly continuous but j^-Holder 
continuous (see Dovgoshey et al. [12] )• 

Expanding maps 

In the setting of expanding maps of the interval, empirical process invariance prin- 
ciples have been established in Collet, Martinez and Schmitt [5] and Dedecker and 
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Prieur [B] for classes of Lasota-Yorke transformations. For these maps, the transfer 
operator has a spectral gap on the space BV of bounded variation functions. Accord- 
ing to Gouezel [16], there exist some uniformly expanding maps of the interval for 
which the transfer operator does not act continuously on the space BV, but admits 
a spectral gap on the space of Lipschitz functions. The example given by Gouezel 
is a transformation of the interval [0, 1). Let (a„)„>i be a sequence of positive num- 
bers with ^ a„ < I and let > be an integer. Denote by the subintervals 
[4 Yl^=i 4 SiLi '^i)- decompose /„ into two subintervals of lenght 2an denoted 
by In^ and In^ . We can find a map Vn (resp. Wn) on [0, 1) with image In^ (resp. 
In^) such that the derivative at a point x is equal to a„(l + 2 cos^(27rn''x)) (resp. 
a„(l + 2sin2(27rn^a;))). The map T is defined on /„ in such a way that f „ and Wn are 
two inverse branches of it. It remains the interval [4 X^i^i 1) ^^^^ subdivide 
into subintervals of equal lenght. T is defined as an afiine transformation on each 
of these subintervals onto [0, 1). 

Theorem (Gouezel |16] ) = iqI^^ o,nd N = A, then T is a Lebesgue measure 
preserving transformation and its associated transfer operator has a spectral gap on 
the space of Lipschitz functions with a simple eigenvalue at 1 and no other eigenvalue 
of modulus 1. Further, the transfer operator does not act continuously on BV. 

In this situation, the 4-th moment bound (11.21) holds and Theorem [1] can be used 
to get an invariance principle for the associated empirical process. 

Further applications 

Durieu [13] has also given 4-th moment bounds for subshifts of finite type, using the 
Ruelle- Perron- Frobenius theorem, as in Parry and Pollicott [18j. Our result thus 
also applies here. 

Another application concerns random iterative Lipschitz models and, as a special 
case, nonlinear autoregressive models {Xn)nm define as follows. For a real-valued 
random variable Xq and a given function / : M — > M, let 

X„ = /(X„„i) + F„, n>l, 

where and (l^n)ra>i C M is an i.i.d. sequence of M-valued random variables inde- 
pendent of Xq. Such models are studied, e.g., in nonlinear time series analysis. 
See Hennion and Herve [T^ Thm.X.16 for conditions under which [Xn)nm is C- 
geometrically ergodic. 
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